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ABSTRACT 



We report the detection of color gradients in six massive (stellar mass (M star ) > 10 10 M Q ) 
and passively evolving (specific star formation rate (SSFR) < 10 _11 yr _1 ) galaxies at redshift 
1.3 < z < 2.5 identified in the Hubble Ultra Deep Field (HUDF) using ultra-deep HST ACS 
and WFC3/IR images. After carefully matching the different PSFs, we obtain color maps and 
multi-band optical/near-IR photometry (BVizYJH) in concentric annuli, from the smallest re- 
solved radial distance 1.7 kpc) up to several times the H-band effective radius. We find 
that the inner regions of these galaxies have redder rest-frame UV-optical colors (U-V, U-B and 
B-V) than the outer parts. The slopes of the color gradient have no obvious dependence on the 
redshift and on the stellar mass of the galaxies. They do mildly depend, however, on the overall 
dust obscuration (E(B-V)) and rest-frame (U-V) color, with more obscured or redder galaxies 
having steeper color gradients. The z ~ 2 color gradients are also steeper than those of local 
early-type ones. The gradient of a single parameter (age, extinction or metallicity) cannot fully 
explain the observed color gradients. Fitting the spatially resolved HST seven-band photometry 
to stellar population synthesis models, we find that, regardless of assumptions on the metallic- 
ity gradient, the redder inner regions of the galaxies have slightly higher dust obscuration than 
the bluer outer regions, implying that dust partly contributes to the observed color gradients, 
although the magnitude depends on the assumed extinction law. Due to the age-metallicity de- 
generacy, the derived age gradient depends on the assumptions for the metallicity gradient. We 
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discuss the implications of a number of assumptions for metallicity gradients on the formation 
and evolution of these galaxies. We find that the evolution of the mass-size relationship from 
z ~ 2 to the present cannot be driven by in-situ extended star formation, which implies that 
accretion or merger is mostly responsible for the growth of their stellar mass and size. The lack 
of a correlation between the strength of the color gradient and the stellar mass argues against 
the metallicity gradient predicted by the monolithic collapse scenario, which would require 
significant major mergers to evolve into the one observed at the present. 

Subject headings: Cosmology: observations — Galaxies: evolution — Galaxies: formation — 
Galaxies: high-redshift — Galaxies: stellar content — Galaxies: structure 



1. Introduction 



The stellar mass (M star ) of "spheroids", namely elliptical galaxies and bulges of spiral galaxies, mostly 
consists of old stars that formed at high re dshift, e.g. z > 2 (Renzini 2006). The spheroids segregate 
60% of all the stars in the local universe ( Hogg et al. 2002 : Bell et al.1l2003 : Driver et al. 2006), and thus 
the mechanisms that led to their assembly are key to the evolution of galaxies in general. But while there 
is agreement on the age of the stars of the spheroids, how these stars got together and formed the body of 
ellipticals and bulges remains an open issue. 

During the past several years, galaxies at z > 1 with M star and SED similar to those of local early-type 
galaxies have been identified and studied in relatively large numbe rs thanks to the increased availability of 
deep optical and n ear-IR photometry from large-a rea surveys (e.g., 



ea surveys (, 
Kriek et al. 


e.g., inompson et ai. iy 
2006albl: Icimatti et al. 


: 

2008; 


nx et auzuuj : 
Onodera et al. 



2010). More recently, an increasing number of studies seem to show that the number density of massive 
galaxies wi th very low specific star formation rate (SSFR) undergoe s rapid evolution between z ~ 2 and 
z ~ 1 (e.g. jFontana et al. Ibood : lArnouts et al. 1 120071 : Ilbert et al. Cassata in preparation). The physical 
mechanisms responsible for this apparently rapid assembly of passively evolving massive galaxies remain 
unknown. Equally unknown is if this is just the assembly of the stellar bodies of the massive galaxies, or 
if the fraction of stars locked in passively evolving systems is also evolving accordingly. Various forma- 



tion mechanisms, for example, merger (e. gJBrinchmann & Ellis l boOdl: Le Fevre et alfz OOO: 



Benson et al 



2003 



De Lucia et al. 



Croton et al 



2006) 



20061 : iNaab et all 120071 : iNaab & Ostrikeril2009h and feedback (e.g., 
have been proposed to explain the rapid emergence of massive passively evolving galaxies (PEGs) dur- 
ing this redshift range, which is also the cosmic epoch when star formation in the universe is at its peak. 
Such mechanisms would leave distinguishable imprints on the color and stellar population gradients of 
massive PEGs. For example, a major merger (i.e. mass ratio approximately unitary) of gas-rich galaxies 
would for m a spheroid and trigger a bur s ts of central star-fo rmation, which would leave a blue core to the 
spheroid (Me nanteau et all 12001 al . 120041 : iDaddi et al.l I2Q05T) . Or, if massive PEGs mostly assemble their 
masses through dry mergers or mergers that do not induce central star-formation, they would generally not 
have blue cores. Thus, studying color gradients and their implications on the stellar population gradients of 
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massive PEGs is expected to provide important clues on the formation of massive PEGs at z ~ 2. 



Related to th e formation mechanisms is the issue of the subsequent evolution of the massive PEGs . 
Recent work (e.g. JPaddi et alj2005l ; Fruiillo et alJ20o3.l2007l : lvan Dokkum etld1bo08l ; lc~assata et allboiol) 
shows that many massive PEGs at z > 1.5 are, on average, ~5 times smaller and ~ 50 times denser than 
their local counterparts with similar mass. The phys ical mechanisms proposed to explain this a pparently 



dramatic evolution of size incl ude major merger (e.g., Hopkins et al. 2009c; Ivan der Wei et alj|2009f) . minor 



IMOPij 

merger (e.g.JNaab et alJ 2009h. adia batic expansion (e.g ., [Fan et al. 

Hopkins et allboioh . Others (e.g.. [Hopkins et al.l 2009b; Mancini et alJl2010D have suggested that the small 



•op 
>c; 



2008), and mass-to-light gradients (e.g., 



size of some PEGs at high redshift may be due to an observational bias such that the low surface-brightness 
halos surrounding these PEGs are not detected by current near-IR observations; if these missing halos were 
detected, the derived size of high-z massive PEGs would be similar to that of their local counterparts. In 
order to answer the question whether the observed strong size evolution of massive PEGs from z ~ 2 to 
z ~ is physical or not, near-IR observations with high sensitivity are required to measure the color and 
stellar population distributions of massive PEGs to large radius. 



Color gradients in early type galaxies have been known for about thirty years (IFabetil 19721) and widely 



studied in local galaxies (e.gJPeletier et al. Il990al ; Tamura et alJl200ol ; lLa Barbera et al. 2005 



La Barbera & de Carvalho 



2009; iGonzalez-Perez et al.ll201lh . but no information is currently available on color gradients in PEGs at 
high redshift (z ~ 2), because of instrumental limitations on sensitivity and angular resolution, given the 
compact size of such sources, and the lack of spectral coverage of the rest-frame optical SED. Ground-based 
observations suffer from poor resolution and/or wavelength-dependent and unstable Strehl ratio. Sensitivity 
to low-surface brightness regions is also limited due to the high and variable sky background. For example, 
the typical full-width half-maximum (FWHM) of the point spread function (PSF) of VLT ISAAC Ks-band 
images is about 0.5", corresponding to ~4 kpc for a galaxy at z ~ 2. This size is al most 4 times of the av- 
erage effective radius of a PEGs with M star = 10 10 M Q at z ~ 2 (ICassata et al.ll2010i and reference therein). 



Even if upcoming adaptive optics systems reach near-HST resolution in the K band, performance degrade 
rapidly at shorter wavelength so that making robust color maps is not yet feasible. To sample the color and 
stellar population gradients of PEGs at z ~ 2 at the ~kpc scale, a minimum angular resolution of about 
~ 0.1" is required at both optical and near-IR wavelength. Although HST NICMOS-1 and NICMOS-2 
have such required resolution, their small fields of view and low throughput make them inconvenient for 
surveying large sky area and observing distant and faint galaxies. The detailed study on color gradients 
of a large sample of high-redshift early-type galaxies is only now available thanks to the WFC3/IR imager 
on-board of HST. 

In this work we use the HUDF HST/ACS images in combination with recent WFC3 near-IR deep 
images in the same field to measure the color gradients of a sample of massive PEGs at z ~ 2 to about 
10 times their effective radius and inferred corresponding gradients of physical properties of the stellar 
populations. We measure color gradients for these galaxies in a series of concentric annuli from the ACS 
and WFC3 images and fit the spatially resolved SED to stellar population synthesis models to derive the 
corresponding gradients of stellar population parameters (SSFR, age and extinction), looking for trends 
between the color gradient characteristics and the stellar population properties in an attempt to derive clues 
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on their origins. 



Throughout we adopt a flat ACDM cosmology with O m = 0.3, £l\ = 0.7 and use the Hubble c onstant 
in terms of h = H / '100km s _1 Mpc -1 = 0.70. All magnitudes in the paper are in AB scale (lOkdll974l) 
unless otherwise noted. 



2. The Data 



In addition to the HST /ACS and WFC3/IR images in the HUDF, the data used in this paper also include 
panchromatic multi-wavelength photometry obtained as part of the GOODS program, as the HUDF field 
is embedded in the GOODS south field. The long wavelength baseline of the GOODS photometry enables 
us to reliably select PEGs based on photometrically-derived stellar mass and SSFR, while the deep HST 
optical and NIR images allow us to obtain color maps of the galaxies with a resolution of ~ 1 kpc. 

The GOODS south field has been observed with various telescopes and instrument combinations, from 
the X-ray to the s ub-millimeter and radio. Relevant to our analysi s here is the VL T/VIMOS ultra-deep 
U-band imaging JNonino et all l2009h . as well as HST/ ACS BViz (biavalisco et al.ll2004k VLT/ISAAC 
JHK, Spitzer/IRAC 3.6, 4.5, 5.7, 8.0 /im, and Spitzer/MIPS 24 fim imaging. Since the resolution of images 
significantly chang es from optical- to IR-band, we use an object template-fitting software dubbed TFIT 
(|Laidler et al.1 120070 to obtain matched multi-band photometry. TFIT requires position priors and light 
profile templates drawn from a high-resolution image (the ACS z-band image in our work) and accurate 
measures of the PSF for all images with various resolutions. It fits the template of an object, whose resolution 
is now downgraded to that of low-resolution images, to the images of the object in low-resolution bands, 
with the flux in each band left as a free parameter. The best-fit flux in each band is used as the flux of the 
object in the band. TFIT can simultaneously fit several objects that are close enough in the sky so that the 
deblendding effect of these objects on the flux measurement would be minimized. Experiments on both 
simulated and real images show that TFIT is able to measure accurate isophotal photometry of objects to 
the limiting sensitivity of images. The TFIT measured fluxes of bands with resolution lower than the z- 
band resolution, together with the SExtractor measured AUTO flux of BViz bands, are merged to build the 
GUTFIT catalog (Grogin et al. in prep.). 



The ultra-deep ACS images in the HUDF ( Beckwith et al.l I2006T) cover an area roughly equal to the 
footprint of the ACSAVFC FOV in the same four filters as the GOODS ACS program, namely F435W (B), 
F606W (V), F775W (i), and F850LP (z) down to a depth of 29.4, 29.8, 29.7, and 29.0 mag (5a, 0.35"- 
diameter aperture), respectively. We use the publicly available images, which have been rebinned to the 
same pixels scale as the GOODS/ACS mosaic, namely 0.03" /pixel (0.6 x the original ACS pixel scale). 

The WFC3/IR data are from the HST Cycle 17 program GO- 1 1563 (PI: G. Illing worth), which aims 
at complementing the HUDF and the two HUDF05 parallel fields doesch et al. l l2007h with WFC3/IR im- 
ages in Y (F105W ), J (F125W), and H (F160W) of matching sensitivity, ~29 mag feouwens et alJboiol : 
Oesch et al.l 120100 . Here we use only the first epoch of the images, released in September 2009, which 
includes 18 orbits in Y, 16 orbits in J, and 28 orbits in H. We have carried out our independent reduction 
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of the raw data, and after rejecting images affected by persistence in the J band, our final stacks reach la 
surface brightness fluctuations of 27.2, 26.6 and 26.3 AB/" 2 in the three bands, respectively, over an area 
roughly equal to the footprint of the WFC3/IR camera (2.1 " x 2.1 "). We have drizzled the WFC3 images 
from their original pixel size of 0.121"x 0.135" to 0.03" per pixel to match the scale of the GOODS and 
HUDF ACS images. 



3. Passively Evolving Galaxies at z ~ 2 in HUDF 

We select passively evolving galaxies based on their SSFRs estimated by fitting the GOODS GUTFIT 
12-band photometry to stellar population synthesis models. During the fit, the value of the redshift param- 
eter is set to either the spectroscopic redshift, when available, or the photomet ric redshift, which we have 



separately measured using the PEGASE 2.0 (|Fioc & Rocca-V olmerange 1997) templates. Another set of 
photometric redshifts is also available in the GOODS-S field iDahlen et alJ boiok and we find that results 



from both sets are in excellent agreement; both sets achieve <r(Az/(l + z)) ~ 0.04 in the redshift range 
considered here. For the SED fitting we use the stellar population synthesis models of Chariot & Burzual 
2009 (CB09), with Salpeter IMF and lower and upper mass limits of 0.1 and 100 M Q , respectively. We 
also use the e~*/ T -model (r-model) to parametrize the star formation history of the galaxies. The free pa- 
rameters that are found by the fitting procedure are the stellar mass M star , th e dust reddening E(B-V), the 
r parameter and the age t of the stellar populations. We use the Ca lzetti Law ( Calzetti et al. 1994 . 200oh to 



model the obscuration by dust and the prescription of llvladau ( 1995 ) to account for the cosmic opacity by HI 



Finally, we average the best-fit model star-formation history over the last 100 Myr to derive the current SFR 
of the galaxy. We estimate the 1 — a error bars of the parameters of the best-fit model from Monte-Carlo 
simulations, where we perturb the photometric measures using Gaussian variates with variance set equal to 
the photometric errors and re-run the fitting procedure 200 times. 

We define passively evolving galaxies in the redshift range 1.3 < z < 3.0 as those whose specific 
star-formation rate satisfies the relation 

SSFR=-^p<10- 11 yr- 1 , (1) 

J-Vlstar 

and restrict our samples to only include massive systems, namely those with M star > 10 10 M . Among 53 
galaxies with M star > 10 10 M and 1.3 < z < 3.0 in the HUDF, 11 galaxies have SSFR< 10~ n yr -1 . We 
exclude two of them from our sample, because we are interested in studying color gradients of early-type 
galaxies, while these systems have irregular morphology and not well-defined centers, possibly implying 
ongoing merging events. We exclude another two galaxies, because they have extremely faint NIR fluxes, 
in fact they have negative J, H and IRAC fluxes, which result in large uncertainties in their photometrically- 
derived physical properties. Finally, an additional galaxy with an obvious spiral morphology has also been 
eliminated from the sample. Although the best-fit SSFR of this galaxy is 10~ li n yr -1 , the probability 
distribution function (PDF) of this SSFR measure has two peaks with similar probability density, one around 
•j_q-ii.11 y r i anc j ^ other io~ 10 - 23 yr -1 , implying a substantial probability for this source to be a star- 
forming galaxy. 
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After these exclusions, the final sample includes 6 galaxies, whose GOODS ID, c oordinates, redshift, 
SED-fitting parameters and H-band effective radius, measured by lCassata et al.1 (120101) . are shown in Table 
[T] Five out of the six gala xies in the sample satisfy pBzK color-color criterion for passively evolving 
galaxies at 1.4 < z < 2.3 (IDaddi et all 120041) or the analog VJL criterion for redshift 2 < z < 3 (VJL, 
Guo et al. in preparation). The last galaxy, 24626, resides just below the pBzK selection window in the 
(B-z) vs.(z-K) color-color plane, most likely because its spectroscopic redshift, z = 1.31, is outside of the 
targeted range of the pBzK crite rion. Four galaxie s have spectrosco pic redshift, 22704 and 23555 from 



Cimatti et a l. (2008), 24279 from lDaddi et all (120051) and 24626 from lVanzella et all (12008). 



Galaxy 23495 has a counterpart in the Chandra Deep Field South 2-Megasecond catalog (ILuo et al. 
2008). It has X-ray luminosity 3.8 x 10 43 er ^/s and 5.6 x 10 43 erg/s i n the soft and hard band , respectively. It 



Luo et al. 



is not, however, detected in the VLA map by Kellerma nn et all (|2008f ) and lMiller et all (|2008T) . Galaxy 24626 
also has a counterpart in the catalog of 



(|2008f) . but it only has a marginal (S/N ~ 1.3) detection in 
the soft band with X-ray luminosity 2.7 x 10 41 erg/s and none in the hard band. We re-investigate the two 
sources with the newly released Chandra Deep Field South 4— Megasecond imageQ and find similar results. 
Other four galaxies have no detection in both bands, either individually or stacked, in the 4— Megasecond 
image. Finally, all our sample galaxies have no detection at 24 //m down to a la limit of 5 /iJy, consistent 
with predictions for passively evolving galaxies at z ~2 (IFontana et al.l l2009). 

Figure Q] shows the images of the sample galaxies in the z- and H-bands, as well as their (z-H) color 
composites. The z-band images have their original resolution (~0.12 " ), while the resolution of the (z-H) 
color images is that of the H-band images, after PSF matching (see later). Both the z-band and H-band 
images show that all the sample galaxies have spheroidal, early-type morphology, while (z-H) color maps 
reveal both analogies and differences among them. All galaxies have a red center and blue outskirts. Galaxy 
24626 has the most well-defined red center and the clearest color gradient. Galaxy 23555 and 24279 also 
have well-defined red centers, but their outskirts are observed at relatively lower S/Ns and the resulting color 
gradient is not as clear as that of Galaxy 24626. In the remaining three galaxies, the location of the red stellar 
populations is slightly off-center, with the distance between the centroid and the red center comparable to 
the H-band half-light radius of the galaxy. After re-sampling both z-band and H-band images to smaller 
pixel scale (0.01" /pixel) and re-registering images, we still find such off-center red cores. Therefore, we 
rule out the sub-pixel image registration issue as the reason of the off-center cores. Instead, we suspect the 
asymmetry in the cores of our empirical PSFs could be the reason. However, the use of annuli photometry 
with size of a few FWHM largely reduces the influence of the asymmetry of PSFs so that it would not impact 
our results, as our later test in ^4. II shows. Since we aim at measuring the color gradients up to ~10 times 



of the H-band half-light radius, it is still reasonable to consider that these galaxies, too, have red centers. 



'http://cxc.harvai'd.edu/cda/Contrib/CDFS.html 
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Table 1 : The Physical Properties of massive PEGs in our sample 



GOODS ID 


RA 


DEC 


redshift^ 


M star 


SSFR 


E(B-V) 


z 






(J2000) 


(J2000) 




Log(M Q ) 


Logtyr" 1 ) 






kpc 


19389 


53.1357303 


-27.7849320 


1.345(p) 


10.18 ±0.11 


-11.98 ± 1.19 


0.10 ±0.04 


l.OZ 


1.02 


22704 


53.1537988 


-27.7745867 


1.384(s) 


10.70 ±0.01 


-14.55 ± 1.00 


0.15 ±0.07 


0.2Z Q 


0.50 


23495 


53.1584550 


-27.7739817 


2.422(p) 


11.07 ±0.05 


-11.98 ± 1.54 


0.25 ±0.06 


0.2Z© 


<0.38 


23555 


53.1588102 


-27.7971545 


1.921(s) 


10.82 ±0.04 


-11.98 ±0.07 


0.00 ±0.01 


1.0Z Q 


0.44 


24279 


53.1630047 


-27.7976545 


1.980(s) 


10.63 ±0.07 


-12.39 ±0.34 


0.00 ±0.01 


0.2Z© 


0.37 


24626 


53.1651596 


-27.7858696 


1.317(s) 


11.10 ±0.04 


-11.15 ±0.05 


0.10 ±0.03 


O.2Z 


3.69 



"The number in brackets indicate the quality of redshifts: p stands for photometric redshift, s for spectroscopic 
redshift 

4. Annular Photometry of Massive Passively Evolving Galaxies 

We measure azimuthally-averaged color gradients for the six galaxies by carrying out aperture-matched, 
multi-band annular photometry. A problem we face when implementing this procedure was how to properly 
define the set of concentric apertures for each galaxy in such a way that it optimally samples the color gradi- 
ent. After some experimentation with automated procedures to determine the annuli based on the effective 
radius (typically in the H-band), however, we resort to set them manually based on a visual inspection of 
the (z-H) color images. We test the robustness of our result against the choice of the apertures by perturbing 
them around the visually determined positions and also by choosing equally-spaced annuli simply based 
on the (visually established) extent of the color gradient. While variations at the level of 10-15% were 
observed, in no case these would change our results and conclusions. The chosen annular apertures for each 
galaxy are shown as white circles in Figure Q] Obviously, while with a sample of six galaxies it is rela- 
tively simple to manually set the concentric apertures, dealing with large samples will require an automated 
procedure to be developed. We plan to come back to this problem in a future paper. 

Before carrying out the multi-band photometry an additional step was necessary, namely matching the 
angular resolution of the images in all the filters to that of the H band PSF to eliminate any artificial color 
gradients introduced by differences in image quality. This varies from FWHM ~0.12" in the BViz bands 
to FWHM ~0. 1 8" in the YJH bands. We will discuss this procedure in the next section. 



4.1. PSF Matched Images 

We measure the PSF in each band from seven well exposed and non-saturated stars whose SExtractor 
stellarity index in the i-band is larger than 0.98. These stars are used as input to the IRAF DAOPHOT 
package to generate an average PSF image in each band. DAOPHOT fits an analytical profile to the central 
region within ~ 1 x FWHM and adds the averaged outskirts of the stars to the best-fit profile. Once we build 
the PSF for each of the BVizYJ bands, we use the package IRAF PSFMATCH to calculate a smoothing 
kernel and convolve each image to match its resolution to that of the H-band. 
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Fig. 1. — The montage of six massive passively evolving galaxies in our sample. Each row shows two 
galaxies. For each galaxy, panels from left to right show the HUDF HST/ACS z-band, WFC3/TR F160W, 
and z-H color images. The GOODS v2.0 ID of each galaxy is labeled in images. The z-band and H-band 
images have different resolution (PSF FWHM of 0.12" and 0.18", respectively), but the z-H color images 
are generated after matching the z-band PSF to that of H-band (see ^4. II for details). The white concentric 
circles outline the annuli used to measure the multi-band annular photometry. For each galaxy, a white line 
shows the scale of 1". 
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before PSF matching after PSF matching 




0.0 0.2 0.4 0.6 0.8 1 .0 1 .2 0.0 0.2 0.4 0.6 0.8 1 .0 1 .2 

radius (arcsec) radius (arcsec) 

Fig. 2. — The fractional encircled energy of PSFs of 7 bands, before (left) and after (right) our PSF matching. 



We test the effectiveness of PSF matching by comparing the fractional encircled energy of each PSF 
before and after the procedure. Figure [2] shows that after matching, the PSF in all bands have identical 
profile, especially within the central region (roughly <0.4" ), where the gradient is steepest. There are some 
very small fluctuations in the wing (0.4" to 1.0" ) of the Y and J band PSFs, due to differences in the airy 
rings of the original PSF These, however, are smaller than 2% and thus we neglect them, since they will not 
cause any detectable bias in our analysis. 

In addition to testing the homogeneity of the matched PSF, we also verify the effectiveness of the 
PSF matching procedure in measuring realistic color gradients by means of simulations that, at the same 
time, also give us information on the effects of the PSF variations across the field. We generate a model 
galaxy with given colors and then inserted it into the images in seven different positions In practice, we 
use each of the seven stars that went into building the average PSF as position-dependent PSF themselves, 
after appropriate normalization. For the model galaxy we use a Sersic spheroid with index n = 2 and 
effective radius R c ff = 0.5 kpc in the H-band, and assigned (X-H) color, where X is one of BVizYJ. We 
convolve the model image with the seven PSFs in each band and inserted the result in the corresponding 
image in proximity to the star. Then, we apply the PSF-matching procedures to the images and measured 
the color gradient of the galaxy at its seven different potions as if these were real measures. Figure [3] shows 
the difference A(X — H) = (X — H) out — (X — H)i n between the "observed" color gradient and the input 
one in each band at each of the seven positions. As the figure shows, there is no evidence of significant 
systematic bias introduced by the PSF-matching procedure, with all the deviations consistent with having 
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Fig. 3. — The effect of PSF-matching in the measure of color gradients. The figure shows the difference 
A(X — H) = output — input, where X is one of BVizYJ, between the input color gradient of a model 
galaxy and the output one, measured from the real images after convolving the model with the PSF of 
each image, inserting the result into the image, applying the PSF-matching procedure and measuring the 
"observed" color gradient. To simulate the effects of a position-dependent PSF, we do not use the average 
PSF of each band, but rather each of the seven stars (after appropriate normalization) that we use to create the 
average PSF. Thus, each point represent the color gradient of the same model galaxy observed at different 
position in the HUDF FOV, while the squares and error bars show the mean and standard deviation of the 
points in each band. 
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random nature. The case of the B band is the one with the largest deviations, but while the scatter of 
A(B — H) is comparatively large, the mean difference between the output and the input colors at radius less 
than 0.6" does not significantly deviate from zero. In the annulus between 0.6" and 0.9", the simulations 
suggest that we underestimate the (B-H) color by ~ 0.4 magnitude, although the B-band flux of PEGs in our 
targeted redshift range is so faint that the background fluctuation, rather than the mismatching of PSFs, likely 
dominates the uncertainty of the color measurement. In practice, however, this has no direct consequence 
in our analysis, since we do not use B-band derived color gradients. In conclusion, our test shows that the 
PSF-matching procedures is effective in recovering the color gradient and introduces no significant bias to 
our measurements. 



4.2. The Reliability of the Annular Photometry: the Probability Distribution of Photometric 

Redshift 

We also conduct a further test of the robustness of results derived from the multi-band annular-aperture 
photometry by comparing the photometric redshift derived from each annulus to that measured from the 
integrated photometry. In principle, the redshift of an annulus should be the same as that of the whole 
galaxy. If large deviations are encountered this flags potential bias in results derived from the annular 
photometry, especially for the most outer annulus, where the S/N in the bluer bands is significantly lower 
than the redder ones and, as we have seen for the B band, other systematics can affect the measures. 

Figure |4] shows the probability distribution function of the photometric redshift measured with the HST 
BVizJH photometry for the annuli and for the whole galaxy for each of our sources. The figure also plots 
the photometric redshift of the galaxies derived from the integrated GUTFIT photometry, as well as the 
spectroscopic redshift if available. Generally, there is good agreement between the photometric redshift of 
the annuli and that of the whole galaxy, with the differences between the peaks of the distribution function 
of the annuli and the whole galaxy photometric or spectroscopic redshift being typically Az/(1 + z) < 
0.05. Exceptions are two of the annuli of galaxy 24626, which deviate from the spectroscopic redshift by 
Az/(1 + z) ~ 0.08, and the outermost annulus of galaxy 23555, which differs from both the spectroscopic 
and photometric redshifts (which agree very well with each other) by Az/(1 + z) ~ 0.12. 

Overall, the agreement between the annuli's photometric redshift and the spectroscopic or photometric 
redshift of the whole galaxies is typical of this types of measures, with no indication that fitting of the 
observed SED of the annuli to stellar population synthesis models to derive the properties of the stellar 
populations might be affected by systematics or other problems. 

Finally, we wish to point out that the availability of resolved multi-band photometry of sub-structures 
with more homogeneous color distribution than the whole galaxy provides a powerful means to improve 
the photometric redshift measurements, as well as to investigate the reason behind catastrophic failures. 
Although the redshift probability distribution of the individual sub-structure does, in general, deviate from 
the true redshift due to random errors, the combined probability distribution, i.e. their product, is generally 
closer to the true redshift and more sharply distributed than that of the whole galaxy, because the simpler 
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Fig. 4. — The probability distribution function of photometric redshift measured (from the HST BVizJH 
photometry) in concentric annuli around our sample galaxies compared to that of the galaxies as a whole, 
as well as to their spectroscopic redshift, when available. The concentric annuli, from the center to the 
outskirts of each galaxy, are plotted with the red, green, blue, violet, cyan, light brown and gray curves. 
The combined probability, i.e. the product of that of each annulus, is also plotted with a black solid curve. 
The black dashed curve shows the probability for each galaxy as a whole. The solid vertical line shows the 
spectroscopic redshift (when available), while the dashed vertical line shows the photometric redshift of the 
galaxy measured using the 12-band GUTFIT (integrated) photometry. 
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case of the homogeneous colors of the sub-structures is better described by the stellar population synthesis 
models than the more complex case of the generally much larger color dispersion inside the whole galaxy. 
This is illustrated in Figure [4] which shows that the peak of the combined redshift probability distribution 
of each galaxy is closer than the individual ones to the spectroscopic or the (12-band) photometric redshift, 
with typical deviations Az/(l+z) < 0.03. For example, although the annuli redshift probability distribution 
of galaxy 24626 show relatively large deviations from the spectroscopic redshift, the combined distribution 
deviates only by Az/(l+z) ~ 0.02, a more accurate estimate than that of the 12-band photometric redshift, 
which has Az/(1 + z) ~ 0.06. We plan to return on this technique using data from the HST CANDELS 
(Cosmic Assembly Near Infra-red Deep Extragalactic Legacy Survey) program (co-PIs: Sandra Faber and 
Henry Ferguson), which in portions of the survey area will include photometry in two additional filters, 
F814W and F998W, in addition to those discussed here. This will provide even more accurate estimates of 
photometric redshift and stellar population parameters. 



5. Color Gradients in Massive Passively Evolving Galaxies 




r/ r eff, H 



Fig. 5. — The rest-frame B-V (top), U-B (middle) and U-V (bottom) color gradients of the six sample 
galaxies. Each galaxy gradient is color and symbol-coded as labeled in the bottom panel. Also shown are 
the IDs of the galaxies in the publicly released GOODS v2.0 source catalog. 
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To investigate possible dependence of the color gradients of the z ~ 2 PEGs with other integrated 
physical properties of the galaxies and also to compare them to those of local early-type galaxies, we 
interpolate the observed photometry in the annuli to the rest-frame U , B and V band and then obtain the 
corresponding (U-B), (U-V) and (B-V) colors (e.g. see IPahlen et all ( 2005 )). Figure [5] shows the color 
gradients of the six massive PEGs, where the rest-frame colors of the annuli are plotted against the annulus 
radius expressed in unit of the H-band half-light radius (R e ff,H)- For five galaxies the available angular 
resolution and sensitivity allow us to measure the color gradients from ~ 1.5 x R^h to ~ 8 x R^h- For 
galaxy 24626, due to its much larger size (in the H band we measure R^h ~ 3 kpc), we are able to follow 
the color gradients down to a much smaller radius, ~ 0.5 x R^h- 

To the extent that our sample is representative of early-type galaxies at z ~ 2, it appears that these 
systems have negative color gradients in all the three colors that we consider, in the sense that stellar popu- 
lation in these galaxies becomes bluer with increasing separation from the center. This property can already 
be inferred from a visual inspection of the (z-H) color images shown in Figure [T] where all galaxies exhibit 
red cores and blue outskirts. 

The colors of two of the galaxies appear invert the blueing trend at large radii, i.e. their color gradient 
shows an upturn to the red at R/R c g h ~ 3^1. Galaxy 23555 exhibits the red upturn in both the (B-V) 
and (U-V) color gradients. The photometric redshift probability distribution of the outermost annulus of 
this galaxy (see £14.21) shows a relatively large deviation from its spectroscopic redshift, suggesting that the 
photometry of this area of the galaxy is subject to some systematics. A visual inspection of the the H-band 
image reveals that this galaxy resides in a relatively dense environment, with a luminous, large companion 
and a bright star located nearby. Low-surface brightness H-band light from these sources is very likely 
contaminating the outermost annulus of the galaxy. Galaxy 24626 has upturns in the (U-B) and (U-V) color 
gradients. Although there are no large or bright sources nearby, a few faint ones are located close to its 
outermost annulus. These sources are also more extended in the near-IR bands than at the optical ones, and 
may significantly contribute red light to the outskirts of the galaxy. 

Our findings of red cores and blue outskirts in massive PEGs at z ~ 2 are in apparent contradiction of 
what reported by llvlenanteau et aL ( 2001a . 2004 ). who also find that a large fraction (>30%) of spheroidal 
galaxies at z ~ 0.5 have strong internal color variations, but in most of their cases the cores appear bluer than 
the surr ounding areas, suggesting that blue cores are common in z ~ 0.5 elliptical galaxies. iMenanteau et al. 



(1200 lbl) even concluded that most (~60%) of their spheroids formed at z < 2. Regardless of the difference 
of targeted cosmic epochs between their and our works, different sample selection criteria could be the main 
reason of the apparent discrepancy. Our g alaxies are selected with both e arly-type morphology and very low 
SSFR determined by SED-fitting, while Menanteau et al. ( 2001a . 2004) only selected galaxies with E/S0 
morphology in HST I F814W images, in dependent of their potential star-formation activity. We also note 
the a recent work by iGargiulo et al.1 (|201ll ) also reported that 50% of their sample of 20 early-type galaxies 
at z~ 1.5 has significant radial color variation, with five with red cores and five with blue cores. Their sample 
was also selected through morphology, mainly based on the visual inspection of HSTI 'ACS F850LP images 
and further cleane d by removing source s with Sersic indexn<2 or clear irregular residuals resulting from 
light profile fitting ISaracco et al.l (120 lOl) . It is likely that the slope of color gradient (negative or positive) 
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has r elation with the s t ar-formation acti vity of galaxies, even t hey all have early-type morphology. Besides, 
both lMenanteau et al.l (l2001aLl2004h and bargiulo et all d201 lb also found a significant fraction (40%~50%) 
of their galaxies to have red cores as ours. However, our sample only contains six galaxies and cannot allow 
to carry out a good statistical analysis to compare with them. The upcoming CANDELS will provide much 
larger samples to evaluate the fraction of red cores in early-type galaxies at z~2. 

We investigate the dependence of the color gradients on the integrated properties of the stellar pop- 
ulations of the galaxies. Figure [6] shows the slope AC/Alog(R) (C and R are the color and the radius) 
of the color gradients as a function of redshift, stellar mass M star > color excess E(B-V) as a proxy of dust 
obscuration, and the global rest-frame (U-V) color of the galaxies. The properties of the stellar populations 
have been measured from fitting the 12-band GUTFIT photometry of the whole galaxies to spectral popu- 
lation synthesis models, as described in $3] We find that the slopes have a mild dependence on the the dust 
extinction E(B-V), in the sense that galaxies with higher dust obscuration tend to have steeper color gradient 
(larger slopes). At face value this seems to suggest that the origin of color gradients is somehow related to 
the dust content of the galaxies. We also find that slopes have a weak dependence on the global rest-frame 
(U-V) colors of galaxies, with redder (U-V) colors corresponding to steeper color gradients. No dependence 
of the slopes on redshift and M star is could be observed. 

We also compare the slopes of the color gradie nts of the z ~ 2 galaxies with that of local ellipticals 



(dashed lines). The local slopes were measured by IWu et al.l (|2005r) . who studied the color gradients of a 
sample of 36 nearby early-type galaxies from the Early Data Release of the Sloan Digital Sky Survey and 
from the Two Micron All Sky Survey. The slopes of the z ~ 2 galaxies that have little or no dust extinction 
are similar to those of the local galaxies, while the z ~ 2 galaxies with more pronounced obscuration have 
steeper color gradients. The color gradients of local elliptical ga l axies are generally interpreted as evidence 



Tamura et al. 



2000; 



Wu et al. 



20051: lLa Barbera & de Carvalhol l2009). We will 



of metallicity gradients (e.g., 
investigate the origins of the color gradients in the z ~ 2 galaxies in next two sections. 



6. Variation of Single Parameter as the Origin of Color Gradients 

In view of the analysis of the color gradients with SED fitting to spectral population synthesis models to 
understand their physical origin, in this section we investigate whether it is plausible that the radial variation 
of one single parameter can be primarily responsible for them. In other words, the observed color gradients 
can, in general, be explained to the radial variation of age, dust obscuration and metallicity of the stellar 
populations, either individually or in combination. Here we constrain the radial gradient of any one of these 
parameters needs to be, while keeping the others constant, for it to be solely responsible for the observed 
color gradients and discuss the implications. We assume simple parametrization for the dependence of the 
selected parameter with the radius, and, for simplicity, we only use a single stellar population (SSP) model 
from CB09 as representative of the SED of our passively evolving galaxies. 

First, we study the possibility that an age gradient is responsible for the observed color gradients, while 
keeping metallicity and dust obscuration constant with radius. We model the age gradient as 
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Fig. 6. — The slope of the (B-V) (bottom row), (U-B) (middle row) and (B-V) (top row) color gradients of 
the six sample galaxies as a function of redshift, stellar mass M sta r> E(B-V) as a proxy for dust obscuration, 
and global rest-frame U-V color. The properties of the stellar populations of the galaxies are derived from 
fitting the 12-band GUTFIT photometry to spectral population synthesis models, as explained in the text. 
The dotted line in each panel shows the slope of the color gradients of local elliptical galaxies measured by 
J2005L 



Wu et al. 
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Fig. 7. — The predicted rest-frame (B-V), (U-B) and (U-V) color gradients from single stellar population 
models if the age of the dominant stellar population is the only parameter that varies as a function of radius in 
the galaxies (thin lines). The observed colors are plotted as black symbols with error bars. We only plot the 
predictions for the case of solar metallicity and zero dust extinction in this figure. Blue, green and red lines 
show the models in which the age at the center has been set at 3, 2 and 1 Gyr, respectively. For each color, dif- 
ferent line patterns show different age gradients models, i.e., from top to bottom, Alog(age)/Alog(R/R c fj ) 
=-0.2 (solid), -0.4 (long dashed), -0.6 (short dashed), -0.8 (dotted) and -1.0 (dashed-dotted). The think black 
line shows the prediction of age gradient that best reproduces the observations. See the text for details. 
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dt = Alog(t)/Alog(R/Refr), where to is the age at the center, and thus a model is fully described by a set 
of four values of the parameters to, at, Z, E(B-V). Rather than finding best-fit models we let the parameters 
vary within a four dimensional (4-D) grid chosen so that the results model predictions for the color gradients 
bracketed the observed ones. The 4-D grid is defined by Z = O.2Z , Z and 2.5Z , E(B-V) = 0.0, 0.05, 
0.10and0.15, to= 1, 2 and 3 Gyr, a t = -0.2, -0.4, -0.6, -0.8 and -1.0. Given a point in the grid, i.e. the vector 
(to, at, Z, E(B-V)), we compute the color gradients of (U-B), (U-V) and (B-V) as a function of radius. We 
also calculate the x 2 as a metric to characterize the goodness of a model in describing the observations, 
defined as: 



where C Q b s ,i and ac,obs,i are the observed color and its uncertainty at a given radius, C Pt i the predicted color 
at the radius, and N b s the total number of observed colors at all radii. 

Figure |7]shows the model color gradients for (B-V), (U-B) and (U-V) compared them with the data. For 
simplicity, we only show the case of Z=Z and E(B-V)=0 in the plot. Blue, green and red lines correspond 
to to = 3, 2 and 1 Gyrs, while the different line patterns show the cases of at'. -0.2 (solid), -0.4 (long dashed), 
-0.6 (short dashed), -0.8 (dotted) and -1.0 (dashed-dotted). 

The top panel shows that the (B-V) color gradient is best approximated by the solid green line, i.e. 
to = 2 Gyr and ctt=-0.2 (x 2 = 1.23). But the middle and bottom panel show that this set of parameters 
overestimates the (U-B) (x 2 = 7.73) and (U-V) (x 2 = 7.12) color gradients. The (U-B) color gradient is 
actually best approximated by the green long-dashed line (to = 2 Gyr and ai=-0.4, x 2 = 5.42), while the 
(U-V) one by the long-dashed blue line (t = 3 Gyr and a t =-0A, \ 2 = 5.71). 

Even when we change value of the Z and E(B-V) within the preassigned range we still cannot find a 
combination of at and to that can simultaneously provide a good description for all the three color gradients. 
The parameter set that best reproduces the (U-B) color gradient is (to, at, Z, E(B-V)) = (3.0, -0.6, O.2Z , 
0.15) with x 2 = 5.31, that of (U-V) by (3.0, -0.4, Z , 0.0) with x 2 = 5.71, and that of (B-V) by (1.0, 
-0.2, 2.5Z , 0.15) with x 2 = 1-10. We also determine which model minimizes the combined x 2 , namely 
Xij-B + Xu-v +Xb-v Th^ s m °del, shown by the black lines in the figure correspond to (to, at, Z, E(B-V)) 
= (3.0, -0.2, O.2Z , 0.0), with xI-b = 5 - 58 > Xu-v = 5 - 89 and Xb-v = L19 - 

Since there always is a different combination of the model parameters in our chosen 4-D grid that 
brackets different set of observed colors, we conclude that no combination of a t and to with constant Z and 
E(B-V), i.e. age alone, can simultaneously explain the three observed gradients. 

We also repeat the same analysis for the case of a metallicity gradient and obscuration gradient to see 
if either one of these could be responsible for the color gradients, finding similar negative conclusions. 

For the case of the metallicity gradient, the parameter sets that best fit the (U-B), (U-V) and (B-V) color 



gradients are (Z , a z = Alog(Z)/Alog(R/R eff ), t, E(B-V)) = (2.5Z , -0.4, 1.0, 0.0) with x 2 = 5.30, 
(Z , -0.6, 1.0, 0.15) with x 2 = 5.44 and (2.5Z , -0.8, 1.0, 0.15) with x 2 = 1-08. The corresponding pa- 
rameter set that results in the minimum combine x 2 is (Z , -0.6, 1.0, 0.15) with X2,u-B = 5.39, X2,u-V = 




(2) 
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5.44, X2,B-v = 1-18. For the case of the obscuration gradient, the parameter sets that best reproduce the 
(U-B), (U-V) and (B-V) color gradients are (E(B - V) , a E{B _ v) = AE(B - V)/Alog(R/R eff ), t, 
Z) = (0.15, -0.08, 1.0, O.2Z ) with x 2 = 5.49, (0.15, -0.04, 1.0, O.2Z ) with x 2 = 6.70 and (0.15, - 
0.08, 1.0, Z ) with x 2 = 1-22. The one that minimizes the combined x 2 is (0.15, -0.06, 1.0, O.2Z ) with 
X2,u-b = 5.53, X2,u-v = Q-99,X2,b-v = 2.12. 

In conclusion, unless the assumption of SSP is grossly inadequate for describing the rest-frame UV/Optical 
SED of our passively-evolving massive galaxies at z ~ 2, it seems unlikely that the radial dependence of 
only one parameter among age, metallicity or dust obscuration (with the other two being constant) can be 
responsible for the observed color gradients. These must originate from the interplay of the gradients of age, 
extinction and metallicity. 



7. Stellar Population Gradients in Massive Passively Evolving Galaxies 



We investigate the nature of the observed color gradients by fitting the HST 7-band photometry (ACS 
BViz and WFC3/IR YJH) in the annular apertures defined before (see Figured]) to the CB09 spectral pop- 
ulation synthesis models to derive the radial dependence of stellar mass, specific star-formation rate, age 
and dust obscuration of the stellar populations in the annuli. We approximate the star formation history with 
an exponentially declining model (e~ t//r ), where the age of the stellar populations is the time t from the 
beginning of the star formation to the time of observation. During the fitting, the redshift of each annulus 
is kept fixed to the spectroscopic redshift or to the photometric redshift of the whole galaxy measured from 
the GUTFIT 12-band photometry. 

While this procedure yields robust estimates of th e stellar mass, dust obscuration, age and metallicity 



suffer from larger uncertain ties and degeneracies (e.g. JPapovich et al.ll200ll ; IShapley et al.ll2001uLee et al. 



2010l : lMaraston et al.ll2010f) . The degeneracy between age, metallicity and dust obscuration is partially bro- 



ken if rest-frame infrared photometry is avail able, as shown by several authors (|de Jongll 19961 : ICardiel et al. 



2003 



MacArthur et al. 



20041 : IWu et all 120051) . Unfortunately, high-angular resolution photometry for our 



galaxies is limited to rest-frame UV and optical wavelengths, and thus we cannot effectively separate the 
role that each parameter plays in the observed color gradients. To gain some insight, however, we can make 
some simplifications and reduce the number of free parameters. Instead of letting the metallicity free to 
vary in each annulus during the fit, we set it according to one of the following three assumed power-law 
metallicity gradients: (1) flat, with logarithmic slope Alog(Z)/ Alog(R) = 0.0; (2) the metallicity gradient 
of local early-type galaxies, with Alog(Z)/Alog(R) = —0.25 (IWu et al.ll2005l): (3) the gradient predicted 
by the monolithic collapse model, with Alog(Z)/Alog(R) = —0.5 (lCarlberglll984l) . The latter model is 
meant to represent the case where the z ~ 2 galaxies have formed "in situ" at some epoch prior that of 
observation through some relatively rapid process. 

In the local universe, elliptical galaxies have very little dust obscuration and their stella r populations are 
essen t ially coeval in the sense that their age spread is s mall compared to the mean age (e.g. jTamura & Ohta 



2004; 



Wu et al 



20051 : lLa Barbera & de Carvalhdl2009l) . The situation can be very different at z ~ 2. The 




Fig. 8. — The HST 7-band photometry (ACS BViz and WFC3/IR YJH) of the sample galaxies in the annular 
apertures discussed in Section 4. The curve for each annulus is color and symbol-coded as red star, green 
diamond, blue triangle, violet square, and cyan circle in going from the center to the outskirts of each galaxy. 
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universe is only 3.5 Gyr old at this time, and thus the approximation of coevality is almost certainly 
no longer valid, since this time is comparable to that required to make a galaxy develop an early-type 
SED following the cessation of star formation. Furthermore, we do not understand the mechanisms of dust 
destruction well enough to make robust predictions on the dust content of early-type galaxies at z ~ 2. 
Dust is expect ed to disappea r on a time-scale of ~ 10 8 years after the end of star formation, but this is not 



observed (e.g., Draine 2009, and reference therein). Thus, we study the more general case where both dust 
obscuration and age are left as free parameters. We will discuss the case of no dust in our analysis later. 

Figure [9] plots, for each galaxy, the gradients of E(B-V) and age from the fits expressed as the ratio 
between the value at center and that in each annulus for each of the three assumptions on the metallicity 
gradient. The error bar for each annulus is the standard deviation of the best-fit values from 200 realizations 
from Monte Carlo bootstrap simulations. The figure also shows the average gradient of each parameter and 
its best-fit slope a = AP/Alog(R), where P is either E(B-V) or log(age), R is the radius, and the average 
includes all sample galaxies but 24626. The best-fit slope and its la uncertainty are plotted in the figure as 
black solid and dashed lines, respectively. 

Galaxy 24626 is excluded from the average, because, as Figure [9] shows, its gradients of dust obscu- 
ration and age are very different from those o f the other galaxies . Its half-light radius and Sersic index, 



R e ff ~ 3.7 kpc and Sersic index n = 7.4 (see ICassata et al.l l2010h . the largest size and most concentrated 



light profile in the sample, as well as its stellar mass, Log(M/M Q ) = 11.1, are typical of the bright el- 
liptical galaxies in the local uni verse often observed in groups with estimated total (dark matter) mass 



M ~ 10 13 M Q (|Guo et al.H2009h . Thus, it is likely that the star-formation and/or stellar-mass assembly 



history of this galaxy considerably differ from those of the other five samples galaxies, a fact that might 
reflect in the radial gradients of its stellar population properties. We also note that although the zYJH band 
images show a regular spheroidal morphology out to ?» 2.5", the B-band image reveals that the galaxy has 
a close companion at about 1" corresponding to 8.4 kpc or « 2.3 x R e g, from its center. 

Figure [9] also shows that, regardless of the assumptions on the metallicity gradient, the implied average 
E(B-V)s always has a mild gradient in the sense that the centers (R/R c g < 3.0) of the galaxies have slightly 
higher dust extinction (AE(B - V) ~ 0.05) than the outer regions (3.0 < R/R c ff < 10.0). Both the slope 
and the amplitude of the dust gradient do not depend on the assumed metallicity gradient, implying that a 
mild negative gradient of dust obscuration is very likely a real feature of massive PEGs at z ~ 2, contributing 
at least in part, to the observed color gradients. This is consistent with the finding, discussed in §5\ that the 
slope of color gradient of the individual galaxies correlate with the global E(B-V) value, i.e. the one from 
the best-fit of each galaxies' GUTFIT 12-band photometry to spectral population synthesis models. 

Due to the age-metallicity degeneracy, however, the contribution of an age gradient to the observed 
color gradient is much harder to determine, since it strongly depends on the assumed gradient of metallicity. 
As Figure|9]shows, a flat metallicity gradient results in the outer regions of the galaxies being ~60% younger 
than the center, while if the local metallicity gradient were assumed then the galaxies would have a flat age 
gradient. Finally, in the case of the metallicity gradient predicted be the monolithic collapse model, the 
stellar populations in the outer regions would by ~2 times older than those in the center. 
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Fig. 9. — The dust {top row) and age {bottom row) gradients of massive PEGs under three assumptions of 
metallicity gradients: (1) the fiat gradient {left column), (2) the local gradient {middle column), and (3) the 
monolithic gradient {right column). Galaxies are plotted in different colors and symbols, as their IDs show. 
Horizontal error bars show the size of each annulus, while vertical error bars show the \-a uncertainty of 
each parameter, which is measured through fits on 200 times Monte-Carlo sampled SEDs. In each panel, 
the black solid and black dashed lines show the best-fit value and 1-er uncertainty of the slope of gradient of 
the parameter. The two black points in each panel show the median values for the bins of R/R c fr < 3.0 and 
3.0 < R/R cff < 10.0. 
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Color gradients and internal co lor dispersion of inte r mediate-redshift early-type ga l axies have been 



extens ively studied in the past (e.g. lAbraham et al] I 1999k iMenanteau et al.1 12001 al . 120041) . lAbraham et al. 
(I1999T) studied eleven 11 early-type galaxies at z ~ 0.5 in the Hubble Deep Field (HDF), finding that most 
(7/1 1) have internal color dispersion consistent with being old and coeval, and implying a small ag e gradient 
in the early-type galaxies at higher redshift. Similar properties remain valid at z ~ as well (IWu et al. 



2005). While this is qualitatively consistent with the null age gradient of our galaxies under the assumption 



of local metallicity gradient, in practice a quantitative comparison requires an accuracy in measuring the age 
that we do not have. At z ~ 2 the age of the universe is about 3.2 Gyr, while it is 8.4 Gyr at z ~ 0.5, and 
our finding of ~ 1 Gyr age gradient with flat metallicity gradient means that the fractional age differential 
is ss 30%. This, however, becomes w 12% (or m 7% at z ~ 0) just because the universe has become older. 
In the next section we will discuss the implications of the assumptions on the metallicity gradients for the 
evolution of the galaxies from z ~ 2 to the present. 



8. Discussion 
8.1. Dust Gradient 

8.1.1. Necessity and Robustness 

The mild gradient of dust obscuration, together with its apparent robustness against assumptions on the 
metallicity gradient, that seems to characterize massive early-type galaxies at z ~ 2 is in general agreement 
with the fact that dust obscuration in early-type galaxies in the local universe is not a dominant effect 
in determining their rest-frame UV/Optical color and color gradient. Thus, it appears that the lack of 
a significant presence of dust, or at least of its effects in the UV/Optical rest-frame SED, is a common 
feature of passively-evolving galaxies, regardless of the cosmic epoch when they are observed. Evidently, 
whatever physical mechanism is responsible for the destruction of dust in the aftermath of the cessation of 
star formation in these systems, must act on a significantly shorter time scale than that required to make the 
galaxy's SED become typical of a "red and dead" system. 

Both the inferred dust gradient and the absolute value of dust obscuration are comparatively small, 
AE(B - V)/Alog(R) ~ -0.07 and < E(B - V) >~ 0.1, and, as we have seen, robust against the as- 
sumptions on the metallicity gradient. An important question is whether or not the opposite is also true, 
namely that dust can be neglected when studying the effects of the age and metallicity gradients in the ob- 
served color gradients and their implications on the evolution of the galaxies, both prior and subsequent to 
the epoch of the observations. To answer this question we re-run the fitting procedure in each annulus under 
the two assumptions: (1) zero dust extinction; (2) dust extinction in each annulus fixed to the integrated 
value for the whole galaxy from the best-fit of the 12-band GUTFIT photometry to the models, i.e. no 
E(B-V) gradient. For simplicity, we only consider the case of the local metallicity gradient. 

The results are shown in Figure [T0l where the new derived age gradients {left and right panels) are 
compared with the age gradient derived by letting dust as a free parameter in the fitting {middle). While the 
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Fig. 10. — The age gradient of the sample galaxies under three different assumptions of dust distribution: (1) 
no dust; (2) dust as a free parameter in each annulus; (3) dust fixed to the global value from the best-fit of the 
12-band GUTFIT photometry to the models. Individual galaxies are color and symbol-coded, as labeled . 
The horizontal error bars represent the width of each annulus, while the vertical ones are the 1-er uncertainty 
of each parameter measured from 200 bootstrap Monte-Carlo realizations of the observed SEDs. In each 
panel the black solid and black dashed curves show the best-fit average gradient and l-a interval. The two 
black points in each panel are the median in the two bins R/R e g < 3.0 and 3.0 < R/R c ff < 10.0. The local 
metallicity gradient is assumed throughout. 



individual points vary, albeit within their 1 — a error bars, the average age gradient remains unchanged in 
all three cases, regardless of the assumption on dust obscuration. 

When the dust obscuration is fixed to zero or to the global 12-band value, the fit yields systematically 
larger reduced x 2 than in the case of free dust. This is not just the effect of an extra free parameter in the 
fit: when we compare the best-fit SED models with the observed photometry, the free dust case obviously 
yields better agreement, especially in the B band, the most affected by dust obscuration. So, dust does play 
a role in determining the colors of the galaxies (the dust gradient slope AE(B — V)/Alog(R) ~ —0.07 
means that the rest-frame B-V color becomes on average bluer by 0.07 mag from R e ff to 10 x R c e). The 
absolute value of dust obscuration and its spatial gradient are small enough, however, that uncertainties on 
both these quantities can be neglected when setting constraints to the age and metallicity gradient from the 
observed color gradient, which are important to infer the evolutionary history of the galaxies as we are going 
to discuss in the next section. 



Throughout this study we have used the starburst obscuration law (iCalzetti et all 19941 . 12000D to model 
the effects of dust. While this is appropriate in the case of starburst galaxies, it is now known if it remains 
a good description in the case of the low SSFR, massive galaxies observed at z ~ 2, such as our sample. 
Regardless, however, this choice appears to actually be a conservative one for the imp lied effects of dust i n 
the co lor gradients, as w e directly v erify by repeating our analysis using the Galactic (|Cardelli et al.l ll989). 
LMC (|Fitzpatrickl 1 19861 ) and SMC ((Prevot et al.lll984r) obscuration laws. For simplicity, we only consider 
the case of local metallicity gradient. As Figure [TT] shows, the slope of the dust obscuration gradient is 
reduced to AE(B — V)/Alog(R) < —0.04 when the Galactic and SMC laws are used, and it is close to 
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Fig. 1 1 . — Similar to Figure |9j but showing the results of different extinction laws. Panels from left to 
right show the result of Calzetti Law, Galactic Law, LMC Law and SMC Law. The metallicity gradient is 
assumed to be the local one. 
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zero in the LMC case. 



The color gradient of local e arly-type galaxies is explained in terms of a metallicity gradient (e.g., 



WuetaL 



20051) . As discussed by I Wise & Silval (119961) . however, a dust gr adient can also reproduce the 



observed broadband color gradients in many ellipticals. Using HST images, Ivan Dokkum & Franxl (| 19951) 
found that 48% of 64 early-type galaxies show highly concentrated dust absorptio n at the centers o f the 
galaxies. The siz es of the dust absorption regions are generally smaller than 1 kpc. Rest et al.1 (1200 lb and 
Tran et all (1200 lh found dust fe atures in 29 out of 67 galaxies (43%), including 12 with small nuclear dusty 
disks, while lLauer et all (120051) found central dust in about half of the 77 galaxies that they observed with 
the HST/WFPC2. 

Both an external and internal origin of the dust in the local galaxies have been proposed. The strongest 
evidence for the external origin, in which galaxies obtain their dust from mergers or accretions, is that the 
distribution and motio ns of ionized gas and dust in some locale early-type galaxies seem unre lated to the 
motions of stars (e.g., 



Goudfrooii & de Jond[l995llvan Dokkum & Frarrxlfl995llCaon et allboool) . However, 



Mathews & Brighentil (120031) and lTemi et all (120071) argued that the dynamical infall time from the edge of 
a local early-type galaxy (several ~ 10 8 yrs) is compatible to the time scale of dust destruction due to the 
sputtering by hot X-ray gas (~ 10 8 yrs). Therefore, in the external origin, cold gas and dust should be 
regularly supplied to galaxies in a time scale of ~ 10 8 yrs. But observations of local early-type galaxies do 
not find such evidence. In fact, mergers between early-type galaxies and dusty galaxies are rarely observed 
in local universe. The lack of effective dust resupplies strongly points to the internal orig in, in which dus t is 
produced inside the galaxies by either the mass loss from evolving red giant stars (e.g., 
M-star winds, as discussed in 



Lauer etal 



(2005). 



Temi et al.ll2007l) or 



Understanding the origin of dust in massive PEGs at z ~ 2 requires spatially resolved stellar and gas 
kinematics, which are not available. If they have undergone merger or accretions to some extent in their past, 
dust obtained during these events should settle to the center with dynamical infall time, a few ~ 10 8 yrs. If 
this dust is responsible for the observed obscuration gradients, then this would argue against the existence 
of hot X-ray emitting gas, which would otherwise sputter the dust in a shorter time scale (~ 10 8 yrs). If 
such gas does exist, then a mor e plausible mechanism to form dust gradients is the episodic settling model 
proposed by Lauer et aL I J2005I) to explain the existence and frequency of nuclear dust in local early-type 
galaxies. The model predicts that dust appears several time throughout the galaxy and then is destroyed as 
it falls into the center. Therefore, the existence of hot gas in the massive PEGs at z ~ 2 is a key to judge 
the possible formation mechanisms. Unfortunately, alth ough hot gas is commonly observed in many (or 
most) local massive elliptical galaxies (see the review of LMathews & Brighentill2003r) . no study on hot gas 
has been done for galaxies at z ~ 2, limited by the sensitivity of our current X-ray detectors. 



8.2. Metallicity Gradient 



In the previous sections we have seen that, because of the age-metallicity degeneracy, different as- 
sumptions for the metallicity gradient result in different age gradients, given the observed color gradients. 
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We have also seen that uncertainties on the dust obscuration (both the total amount and its gradient) do 
not affect the quantitative details of the relationship between age and metallicity, given the current uncer- 
tainty. Thus, an interesting question to ask is: given a realistic assumption for the metallicity gradient at 
z ~ 2, namely one that is consistent with the metallicity gradient seen in early-type galaxies z ~ and 
with our current ideas on how metallicity gradients evolve, what is the implied gradient of stellar population 
age? What would such an age gradient tell us about the way z ~ 2 PEGs assembled and their subsequent 
evolution, if they really are the progenitors of the local early-type galaxies? 



8.2.1. Flat Metallicity Gradient 



Under the assumption of a flat metallicity gradient, i.e. metallicity is constant as a function of radius, the 
observed color gradients of the galaxies imply a negative age gradient, namely the age of the stellar popula- 
tions is younger as the radial distance from the center increases, with average gradient Alog(t)/ Alog(R) ~ —0.1. 
Stars located at w 10 x R cff from those in the center are, on average, ~ 1 Gyr younger. The SSFR is also 
higher in the outer regions. For example, the external rings in three of the galaxies, 19389, 22704 and 24626 
have SSFR> 10 -11 /yrs, larger than the global value we use to classify the galaxies as passive. The younger 
stellar populations and larger specific star formation rate in the outskirts of the galaxies could mean a later 
cessation of star formation relative to the center, newer episodes of star formation or accretion of younger 
stellar populations. 

It is interesting, at this purpose, to explore whether the residual star formation in the outskirts can 
explain the apparent si ze evolution of massive passively-evolving ga l axies from z ~ 2 to • z ~ 0, as discussed 



by re cent studies (e.g. JPaddi et al.ll2005l : iTrujillo et al.ll2006l . 120071 : Ivan Dokkum et al.ll2008l : ICassata et al. 
2010). To do so, we simulate a galaxy with Ser sic index n = 2.0 a nd effective radius R e g- = 0.5 kpc 



(typical values for massive PEGs at z ~ 2, see by ICassata et al.1 (|2010)), central SSFR 10 yr and the 



same SSFR gradient as the one in Figure [9] If this galaxy evolves from z = 2 to z = only through 
in-situ star formation, i.e. with no significant accretion of external stars, our calculation shows that it cannot 
evolve into today's typical massive early-type galaxies, which have n = 4 and R c g ~ 2.5kpc, since that 
would require the SSFR in the outskirts to be > 1.5 dex higher than the central one SSFR, a much steeper 
gradient than our observations seem to find. This implies that external mechanisms, such as merger and 
accretion, are necessary to build the extended halos of massive PEGs from z ~ 2 to z ~ 0. We also note 
that the assumption of flat metallicity results in the steepest positive SSFR gradient compared to the local 
metallicity gradient and the monolithic-collapse gradient that we will discuss next. Thus, these two cases, 
too, would imply external mechanisms if the z ~ 2 massive PEG are to evolve into the local early types. 

If merger and accretion do drive the evolution, they must be able to do so in a way that makes the flat 
metallicity gradient evolve into the one observed in local early-type galaxies, Alog(Z)/Alog(R) ~ —0.3, 
while at the same time cancel the negative age gra dient, since the age the stellar populations in local galaxies 
has ve ry little but positive radial dependence (e.g. JTamura & Ohtal2004l : IWu et al.l2005l : lLa Barbera & de Carvalho 
2009). While secular orbit mixing could help explain today's flat age gradient, it seems hard to understand 



how a flat metallicity gradient at z ~ 2 can evolve into the local one if majo r merger drives the evolution, 
since major merger is believed to flatten, not steepen the metallicity gradient (IKobayashill2004l) . 



8.2.2. Local Metallicity Gradient 

If we assume that the z ~ 2 PEGs have the same metallicity gradient as their local counterparts, the 
observed color gradients imply no age gradient, as shown in the middle panel of Figure|9l In other words, the 
radial dependence of metallicity and age of the the stellar populations of the z ~ 2 galaxies is already similar 
to that of their local counterparts. Furthermore, the implied gradient of SSFR is also flat. Thus, if merger or 
accretion drive the evolution to z ~ 0, this must happen in a way that maintains the gradients of metallicity 



and a ge roughly constant in time. Since major merger appears to flatten the metallicity gradient (IKobayashi 



2004), the assumption of the local metallicity gradient would also imply a more gradual a ccretion process as 



the on e responsible for the apparent growth in size of PEG from z ~ 2 to the present (e.g. Ivan Dokkum et al. 



2010). 



8.2.3. Monolithic metallicity gradient 



The monolithic collapse is an idealized model in which a whole worth of stars of a massive galaxy 
form during ~ 1 dynamical time scale. Although a recent monolithic collapse model by lPipino et all (120 ldl) 
that allows certain scatter for the star formation efficiency would produce t he me talli city gradient that agrees 
with the observation of local elliptical galaxies, earlier models by lLarsonl(| 19741) and lCarlbergl (119841) define 



Larson 



a max imu m steepness boun dary in the metallicity gradient slope-mass plane. We discuss models by 
(| 19741) and ICarlbergl (| 19841 ) here simply as the limiting case of a class of assembly mechanisms capable to 
produce the steepest metallicity gradient across the galaxy. 

During the monolithic collapse, stars begin to form everywhere in the collapsing cloud and, once 
formed, remain in their orbits with little net inward motion, while the gas keeps sinking to the center of 
the galaxy due to dissipation. While getting closer to the center, the gas become more and more enriched 
by the rapidly evolving massive stars. Consequently, stars formed in the central regions are more metal rich 
than those formed in the outskirts. Stellar feedback tends to reduce the inflow of gas and hence reduce the 
metallicity gradient. But gas outflows occur earlier and more effectively at large galactocentric distance than 
in the center due to lower escape velocity, lowering the star-formation rate at larger distance and contribute 
to create a strong negative metallicity gradient and a positive age one. 

Under the assumption of the monolithic collapse metallicity gradient, the observed color gradients of 
our sample galaxies indeed imply a positive age gradient such that the stars at R « 10 x R e g in are ~ 0.5 
Gyr older than those in the central regions. We could directly test this prediction of the monolithic collapse 
(or equivalent scenarios), if we were able to independently measure the age gradient, something that is 
not possible with the present data. The monolithic collapse metallicity gradient also implies a weak SSFR 
gradient for our galaxies such that the outer regions at R ^ 10 x R c g have SSFR ~ 0.5 dex lower than in 
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the center. This is qualitative consistent with the general feature of the model that stellar feedback is more 
effective at larger radii than at the center at reducing the star formation activitjH 

If the PEGs observed at z ~ 2 formed through mechanisms similar to monolithic collapse, their subse- 
quent evolution must be such to significantly reduce the magnitude of their metallici ty gradient and to a mi 



nor e xtent the age gradient, since the monolithic-c ollapse gradient is much steeper (Larson 1 1974: 



Peletier et al 



1990bl : lldiart et al.ll2003l : 



Carlber; 



Tamura & Qhta 



Kobayashil (120041) . although the effectiveness of minor 



1984) than that observed in local ellipticals (e.g., 
20031) . Major merger provides such a mechanism 
merger or gas accretion and subsequent star formation in diminishing the steep metallicity gradient is not 
known. 

The monolithic collapse model also predicts that the slope of the metallicity gradient, and hence the 
color gradient, depends on the mass of the galaxies, because a deeper potential well is mor e effective at 



retain ing metals in the center than a shallower one and thus make more metal-rich stars (e.g. jTortora et al. 



2010T) . As shown in Figure [5j the color gradient of our galaxies does not show any obvious dependence on 
the stellar mass, to the extent that this quantity is a good proxy for the galaxies' total mass. The relative 
high scatter in our small sample and the limited stellar mass dynamic range that it probes, however, might 
hide such signal. We will return to the correlation of the color gradient with the galaxies' properties using a 
much larger and significantly deeper sample extracted from the new WFC3 CANDELS survey. 

Independent measures of the age gradient of the stellar populations would also test if mechanisms 
similar to the monolithic collapse play a role in assembling the z ~ 2 PEG, since in this case the stars in 
the central regions would be younger than those in the outskirts. In fact, such an inverted age gradient is 
requested by our data in order to reproduce the observed color gradients if the metallicity gradient of the 
monolithic collapse is assumed, since this would result in significantly steeper color gradients, as we have 
directly verified. 



8.3. Formation of Passively Evolving Galaxies at z~2 

It is more likely, however, that mass ive PEGs at z~2 a re formed through gas-rich major mergers rather 



than a single collapse process. Recently. IWuyts et al.1 (l2Q10f) analyzed SPH simulations of gas-rich mergers 



and their remnants that are treated with radiative transfer. They predicted that quiescent compact galaxies 
at z~2 should typically show red cores and their color gradients should be a superposition of age, dust, 
and metallicity gradients. They found that in the gas-rich merger scenario, stars in the galactic center are 
formed during final coalescence out of more obscured and enriched gas. The dust and metallicity gradients 
compensate the positive age gradient (young center and old outskirts) so that their cores are typical when 
these galaxies are classified as passive systems. They also predicted that the strength of the color gradient 
to be correlated with galaxy's integrated color. All these predictions agree well with our observations and 



2 We note that according to lMartinelli et alJdl998f) constant star formation efficiency with galactocentric distance can also explain 
the observed metallicity gradient and the correlation between colors/metallicity and escape velocity in early-type galaxies. 
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serve as important evidence of the validity of gas-rich major mergers. 



Even if gas-rich major merger is conside red as the f ormat ion mechanism of massive PEGs at z~2, 
their subsequent evolution is still ambiguous. iKobayashil (12004) predicts that gas-rich merger can effec- 
tively flatten the metallicity gradient to the one of local early-types. Therefore, mechanisms that can signif- 
icantly flatten the m etallicity gradient, such as major merger, are not required in the subsequent evolution. 
Wuyts et all (120100 . however, find that the typical metallicity gradient in the simulated z 2 gas-rich merger 
remnants is steeper than the typical metallicity gradient of local early-types. This requires major mergers in 
the subsequent evolution to flatten the metallicity gradient, unless other mechanisms (accretion and minor 
merger) are proved to be capable to flatten the metallicity gradient too. 

Overall, with the current data it is not possible to conclusively rule out or validate any of the three 
cases of metallicity gradients and possible formation mechanisms of massive PEGs at z~2 that we have 
discussed. Passively evolving galaxies appear to undergo su bstantial structural evolution from z ~ 2 to 
z ~ that reduces their compactness and ste llar density (e.g. jDaddi et al.ll2005l : iTrujillo et al JI2006L 120071 : 



van Dokkum et al.ll2008uCassata et al. 2010) If major merging events are the driver of this evolution, to the 
extent that we understand how merger rearranges gradients of metallicity and age, it seems unlikely that the 
z ~ 2 PEG have flat metallicity gradients, since subsequent merger can only keep it flatter, not steepen it to 
an extent required to match the observed one in local ellipticals. Of course, the size evol ution can be driven 



by les s dramatic minor merging events or continuous accretion, as some have suggested ( van Dokkum et al. 



2010l) . We do not know what these mechanisms would imply for the evolution of the metallicity and age 
gradients compatible with the color gradients observed at z ~ 2 if they have to evolve into those observed 
at z ~ 0. Finally, we remind that our discussion is based on resolved photometry that only covers the 
UV/Optical rest frame. Future high-resolution observations with JWST extending the wavelength baseline 
to the near and mid-IR will allow us to considerably reduce the extent of the age-metallicity degeneracy, 
and help us constraint a self-consistent evolutionary scenario for the assembly of the z ~ 2 PEG, as well as 
their subsequent evolution. 



9. Summary 

We have discussed the implications of the detection of color gradients in early-type galaxies at z ~ 2 
from deep high-angular resolution images at optical and near-IR wavelengths obtained with HST and the 
ACS and WFC3 cameras. 

In particular, we have measured resolved rest-frame UV-optical colors of a sample of six massive 
(> 10 10 M Q ) and passively evolving (SSFR< 10 _11 yr -1 ) galaxies at 1.32 < z < 2.42. After defining 
for each galaxy a set of concentric apertures that optimally sample the observed gradient of colors, we 
have carried out fits to spectral population synthesis models using the available seven-band (BVizYJH) 
photometry to derive how dust obscuration (E(B-V)), mean age, specific star formation rate (SSFR), and 
stellar mass (M sta r) vary with the galactocentric distance. We have then used this information to discuss 
possible evolutionary scenarios for these galaxies in light of recent results on the apparent evolution of 
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their morphological evolution and on theoretical expectations on how merger modifies existing gradients of 
metallicity and stellar age. This paper can be summarized as follows: 

1. Color gradients could be measured over scales that typically go up to « 10 x R e g, where R c g- is the 
effective radius of the Sersic profile. The HST images show that the inner regions of these galaxies 
have redder rest-frame UV-optical colors (U-V, U-B and B-V) than their outer parts. 

2. The slopes of the color gradients have no dependence on the redshift and stellar mass of the galaxies. 
However, they have a mild dependence on the global dust extinction and rest-frame U-V color of the 
galaxies. Galaxies with larger E(B-V) or redder U-V color tend to have steeper color gradients. 

3. The slopes of the color gradients of these galaxies are generally steeper than that of local early-type 
galaxies. 

4. We investigate whether the variation of a single parameter (age, extinction, or metallicity) along radius 
can be used to explain the observed color gradients. Using the single stellar population model, we find 
that the variation of any single parameter cannot simultaneously fit the three observed color gradients 
(U-B, U-V and B-V) with the maximum likelihood. We conclude that the observed color gradients of 
massive PEGs at z ~ 2 cannot be explained by a single gradient of age, extinction or metallicity and 
should be originated from an interplay of gradients of the three parameters. 

5. The fits of spatially resolved stellar populations to the spectral population synthesis models are run un- 
der three assumptions of metallicity gradients: (1) a flat metallicity gradient (Alog(Z) / Alog(R) = 0), 
(2) the metallicity gradient of local early-type galaxies (Alog(Z)/Alog(R) = 0.25), and (3) the gra- 
dient predicted by the monolithic collapse (Alog(Z)/Alog(R) = 0.5). 

6. Regardless of the assumptions on metallicity, a modest gradient of dust obscuration is always implied 
from the fits in the sense that the central regions of the galaxies have slightly higher dust obscuration 
than the outer parts , with an ave r age g r adient of AE(B — V)/Alog(R) ~ —0.07, if the starburst 



(MW, SMC, LMC) result in smaller obscuration gradients. Overall, both the absolute value of dust 
obscuration and its gradient are small, however, consistently with the present-day early-type galaxies, 
where dust generally has small, if any effects on the observed colors. It appears that once a galaxy 
has become passive, for whatever physical mechanisms, dust obscuration ceases to play a significant 
role in the determining the UV/Optical SED. 

7. While dust obscuration contributes in small measure to the observed color gradients of the z ~ 2 
galaxies, its presence does not seem to affect the general age-metallicity degeneracy in the sense that 
the implied gradient of age derived from a given assumption for the gradient of metallicity does not 
depend on how dust is treated, i.e. if forced to a fixed value or left as a free parameter in each annulus, 
or on the adopted extinction law. Whatever inference on the age or on the metallicity gradient is made, 
after assuming one or the other parameter, does not seem to appreciably depend on the assumption on 
dust obscuration. 




Other extinction laws that we have tested 
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Due to the age-metallicity degeneracy, the derived age gradients are strongly coupled with the as- 
sumed metallicity gradients: (1) assuming a flat metallicity gradient, the outer regions of the galaxies 
are younger than the inner regions with a age gradient of Alog(t)/Alog(R) ~ —0.1; (2) assuming the 
metallicity gradient observed in local early-type galaxies, the stellar populations in the outer regions 
have same age as those in the inner regions; and (3) for the metallicity gradients predicted by the 
monolithic collapse, the outer regions are older than the inner regions, with the average age gradient 
Alog(t)/Alog(R) ~ 0.15. Their specific star-formation rate is also ~0.5 dex lower than that in the 
inner regions. 

The mass-size (or equivalently mass-stellar density) relationship of the z ~ 2 galaxies cannot 
evolve into the local one only through in-situ star formation driven by the small observed star- 
formati on activity (SSFR< 10~ n yr _1 or less). This implies the accretion of stellar mass from 
outside (|van Dokkum et al.ll2010h . 



10. Overall, with the current data it is not possible to conclusively rule out or validate any of the three 
cases of metallicity gradients that we have considered. A major source of uncertainty is the fact that 
major merger rearranges the gradients of metallicity and age on a short time scale, while less dramatic 
events such as minor merger or a more continuous accretion might induce a more "secular" evolution 
of these properties. Passively evolving galaxies appear to undergo substantial structural evolution 
from z ~ 2 to z ~ that red uces their compac t ness by a factor of 3-5 and their stellar density by 



2 orders of magnitude (e.g., baddi et al.ll2005l : iTrujillo et al.ll2006l 120071 : Ivan Dokkum et al.ll2008 



Cassata et al.ll2010h If major merging events are the driver of this evolution, then, to the extent that 
we understand how merer rearranges gradients of metallicity and age, it seems unlikely that the z ~ 2 
PEG have flat metallicity gradients, since subsequent merger can only keep it flatter, not steepen it 
to an extent required to match the observed one in local ellipti cals. Of course, the size evolution can 
be driven by minor merger/accretion, as some have suggested (Ivan Dokkum et al.ll2010h . In this case, 
we have much less guidance in inferring which metallicity and age gradients are compatible with the 
color gradients observed at z ~ 2 if they have to evolve into those observed at z ~ 0. 

11. While it is possible that the subsequent evolution reconciles the metallicity and age gradient emerging 
from the monolithic collapse to those observed at z ~ 0, the observations do not seem to show any 
correlation between the strength of the color gradient and the stellar mass, which is predicted if the 
z ~ 2 PEG formed through such a mechanism. The inherent statistical noise in a sample as small as 
ours, and the fact that the sample itself only covers a small dynamic range in mass, can very well hide 
any such correlation. We do observe, however, a correlation between the color gradient and the dust 
obscuration (E(B-V)), even if such parameter is generally much less accurately estimated with broad- 
band SED fitting than the stellar mass, which is the most accurate one. This seems to support the lack 
of a correlation between the color gradient and the stellar mass, and thus argue against the monolithic 
collapse, or any formation mechanism capable to produce an equally steep metallicity gradient, as 
responsible for the formation of the z ~ 2 PEG. We will return on this subject using substantially 
large samples of such sources from the CANDELS project. 
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12. The metallicity gradient of the galaxies could be either close to that of the local early-type galaxies or 
flat. In the first case, the subsequent evolution must be such to preserve the metallicity gradient, which 
would seem to rule out major merger. In the second case, the evolution must create the gradient. This 
also seems to rule out major merger, since it can only flatten, not steepen, the gradient. 
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